A Neural Substrate of Prediction and Reward | Science (1997)
WOLFRAM SCHULTZ
,
PETER DAYAN
, AND
P. READ MONTAGUE
https://doi.org/10.1126/science.275.5306.1593
ドーパミン(Dopamine)
予測報酬誤差(reward-prediction error)
Temporal Difference error; TD error